On the categorization of Cause and Effect in WordNet

نویسندگان

  • Cristina Butnariu
  • Tony Veale
چکیده

The task of detecting causal connections in text would benefit greatly from a comprehensive representation of Cause and Effect in WordNet, since previous studies show that semantic abstractions play an important role in the linguistic detection of semantic relations, in particular the cause-effect relation. Based on these studies on causality, and on our own general intuitions about causality, we propose a cover-set of different WordNet categories to represent the ontological classes of Cause and Effect. We also propose a corpus-based approach to the population of these categories, whereby candidate words and senses are identified in a large corpus (such as the Google N-gram corpus) using specific syntagmatic patterns. We describe experiments using the CauseEffect dataset from the 2007 SemEval workshop to evaluate the most effective combinations of WordNet categories and corpus data. Ultimately, we propose extending the WordNet category of Causal-Agent with the word-senses identified by this experimental exploration.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Construction of Persian ICT WordNet using Princeton WordNet

WordNet is a large lexical database of English language, in which, nouns, verbs, adjectives, and adverbs are grouped into sets of cognitive synonyms (synsets). Each synset expresses a distinct concept. Synsets are interlinked by both semantic and lexical relations. WordNet is essentially used for word sense disambiguation, information retrieval, and text translation. In this paper, we propose s...

متن کامل

The Use of WordNets for Multilingual Text Categorization: A Comparative Study

The successful use of the Princeton WordNet for Text Categorization has prompted the creation of similar WordNets in other languages as well. This paper focuses on a comparative study between two WordNet based approaches for Multilingual Text Categorization. The first relates on using machine translation to access directly the princeton WordNet while the second avoids machine translation by usi...

متن کامل

Punjabi WordNet Relations and Categorization of Synsets

This paper describes an attempt to develop Punjabi WordNet by using expansion approach from Hindi WordNet under Indradhanush WordNet Project. The origin, symbols, morphological and syntactic characteristics of Punjabi Language are presented in this paper. The lexical semantic relations used in Punjabi WordNet are elaborated. The need for synset categorization and the results of this categorizat...

متن کامل

A New WordNet Enriched Content-Collaborative Recommender System

The recommender systems are models that are to predict the potential interests of users among a number of items. These systems are widespread and they have many applications in real-world. These systems are generally based on one of two structural types: collaborative filtering and content filtering. There are some systems which are based on both of them. These systems are named hybrid recommen...

متن کامل

Integrating a Lexical Database and a Training Collection for Text Categorization

Automatic text categorization is a complex and useful task for many natural language processing applications. Recent approaches to text categorization focus more on algorithms than on resources involved in this operation. In contrast to this trend, we present an approach based on the integration of widely available resources as lexical databases and training collections to overcome current limi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007